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Abstract. Many statistical applications require establishing central limit theorems for sums/integrals 
Sr(h) = J t€lT h(Xt)dt or for quadratic forms Qr{h) — J t sg/ ^ b(t — s)h(Xt, X s )dsdt, where X± is a 
stationary process. A particularly important case is that of Appell polynomials h(X t ) = P m (X t ), 
h(Xt,X 3 ) = P mi n(Xt,X a ), since the "Appell expansion rank" determines typically the type of central 
limit theorem satisfied by the functionals Sr(h), Qr(h). We review and extend here to multidimensional 
indices, along lines conjectured in [16], a functional analysis approach to this problem proposed by 
Avram and Brown (1989), based on the method of cumulants and on integrability assumptions in the 
spectral domain; several applications are presented as well. 

Resume. Nous considerons ici des theoremes de limite central pour des sommes/integrales Sr{h) = 
J j h(Xt)dt, et pour des formes quadratiques Qr(h) = J t sg/ b(t — s)h(Xt, X s )dsdt, ou Xt est un 
processus stationnaire. Un cas particulierement important est celui des polynomes dAppell h(Xt) = 
Pm(Xt), h(Xt,X 3 ) = P m ,n(Xt,X s ). Pour ce probleme, nous generalisons ici au cas des indices multidi- 
mensionnels une approche proposee par Avram et Brown (1989), basee sur la methode des cumulants 
et sur des hypotheses d'integrabilite dans le domaine spectral. Plusieurs applications illustrent la 
versatilite de l'approche. 
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Introduction 

Model. For a unified treatment of the discrete and the continuous (multi-dimensional) setups, we 
assume that Xt,t £ I is a real stationary random field, where I denotes a measurable group endowed 
with its Haar measure, and f tI v{dt) is integral with respect to the Haar measure. Depending on 
the continuous/discrete setup, I will be either M rf with Lebesgue measure, or Z rf with the counting 
measure. Discrete sums might be written either as integrals (in the statement of theorems), or in 
traditional sum notation (in the expository part). For continuous case we assume that X t ,t £ W 1 is a 
measurable random field. 

We will assume throughout the existence of all order cumulants cp.(ti,t2, ■••,£&) for our stationary 
in the strict sense random field Xt, which are supposed to be representable as Fourier transforms of 
"cumulant spectral densities" /&(Ai, Xk-l) S L\, i.e: 

Ck{h,t2, ■■■,tk) = Ck(h — tk, .., tfc_i — ifc, 0) = 
= / e < * lA 'fe- tfc Vfc(Ai,... J A fc _i)|i(dAi).../i(dA fc _i). 

JXi,...,X k -ieS 

Throughout, S will denote the "spectral" space of discrete/continuous processes and /J,(d\) will denote 
the associated Haar measure, i.e. Lebesgue measure normalized to unity on [— it, ir] d and Lebesgue 
measure on M. d , respectively. The functions /fc(Ai, Xk-l) are symmetric and generally complex 
valued for real field Xf. 

The random field Xt is observed on a sequence It of increasing dilations of a finite convex domain 
It, i.e. 

It = T I\, T — ► oo. 
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Correspondingly, we will consider linear and bilinear forms St and Qt, where summation/integration 
is performed over domains It- 

In the discrete-time case, the cases It = [l,T] d , T G Z + (in keeping with tradition) or It = 
[-T/2, T/2] d , T G 2Z + will be assumed. In the continuous case, we focus on rectangles It = {t G M rf : 
-T/2<U <T/2,i = l,...,d}. 

Later on we will see that the choice of a specific set I\ leads (when working in the spectral domain) 
to using an associated Dirichlet type kernel 

A T (A)= / j tx v{dt) (0.1) 
Jtei T 

and a multivariate Fejer kernel (|0.2p . Explicit well-known formulas are available when I\ is a rectangle 
or a ball, both for the discrete and continuous case. We will work in the paper with rectangular 
domains, however, some extensions are possible for balls by replacing corresponding estimates (see 
Appendix B). 

Motivation. Via the work of Szego, Schur, Wiener and Kolmogorov, the study of stationary one- 
dimensional discrete time series, defined by choosing I = Z, has been well developed and became 
tightly interwoven with several branches of mathematics, bringing forth jewels like for example the 
Wiener-Kolmogorov formula identifying the variance of the prediction error with respect to the past 
with the integral of the log of the spectral density. The extension to the continuous time case I = M, 
provided by Krein, exemplifies the subtle challenges brought by modifying the nature of the index set. 

The convenience of time series comes largely from the FARIMA family of parametric models, defined 
as solutions of equations 

<t>{B){l-B) d X t = e{B)£ t 

where B is the operator of backward translation in time, <j)(B), 0(B) are polynomials, d is a real number 
and £,t is white noise [51], [45], [50], [23], [73]. Using the FARIMA family of models, one attempts, 
via an extension of the Box-Jenkins methodology, to estimate the parameter d and the coefficients of 
the polynomials <p(B),9(B) such that the residuals £t display white noise behavior (and hence may be 
safely discarded for prediction purposes). 

An extension of this approach to continuous time and to multi-parameter processes (spatial sta- 
tistics) has been long time missing. Only recently, the FICARMA ( [28], [6]) and the Riesz-Bessel 
families of processes (which essentially replace the operator B by the Laplacian - see Appendix A), 
have allowed a similar approach for processes with continuous and multidimensional indices (at least 
in the rotationally invariant case). 

These examples illustrate the point that extension of ARIMA-type models to continuous and mul- 
tidimensional indices is an important challenge. 

Central limit theorems for quadratic forms. Asymptotic statistical theory, and in particular, 
estimation of the parameters of FARIMA and Riesz-Bessel processes, requires often establishing central 
limit theorems concerning 

T 

sums St = StQi) = h(Xi) 

i=l 
T T 

and bilinear forms Qt = Qt(^) = b(i — j) h(Xi,Xj). 

i=l j=l 
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of functions of stationary sequences Xi (and their generalizations to continuous and multidimensional 
indicies) . 

Notes: 1) The kernel of the quadratic form will be denoted by b(t), so that we may reserve 6(A) 
for its Fourier transform. 

2) The random field Xt, t £ It, will be allowed to have short-range or long-range dependence (that 
is, summable or non-summable correlations). 

A particularly important case is that of Appell polynomial^ /i(X t ) = P m (X t ), h(X t , X s ) = P m , n (X t , X s 
associated to the distribution of Xt, which are the building block of the so called "chaos/Fock expan- 
sions". Two main cases were distinguished, depending on whether the limit is Gaussian or not (the 
latter case being referred to as a non-central limit theorem). 

This line of research, initiated by Dobrushin and Major [32] and Taqqu [71] in the Gaussian case 
(see also Ivanov and Leonenko (1989) for Gaussian continuous case), by Giraitis and Surgailis [40], [44], 
and by Avram and Taqqu [11] in the linear case, continues to be of interest today [38], [42], [43], [60], 
[63], [67] to name only a few papers. 

Our interest here is in obtaining extensions to continuous and multi-parameter processes of the 
central limit theorem for sums and quadratic form, obtained in the case of discrete time series by 
Breuer and Major [27], by the method of moments. 

Some analytic tools. A key unifying role in our story is played by the multivariate Fejer kernels: 



n— 1 n—l 



*?■(«!, ...,«„_!) = (2 ^ (/i) ^_ 1)dT , A T (-^n e ) J] A T (n e ), (0.2) 

and their kernel property: the fact that when T —* oo, the multivariate Fejer kernel <& T con- 
vergence weakly to a 5 measure: 

Lemma 0.1. The kernel property: For any continuous bounded function C{u\, .., u n -i) } it holds 
that: 



„ n—i 

/ C(m,..,u„_i)^(ui,...,u n _i) TT /i(dui) = C(0,...,0) 

'is™- 1 , 



lim 



Proof: For the discrete onedimensional case we refer for example to [12], and for the continuous 
multidimensional case, with I\ a rectangle, to Bentkus [21], [22] or [8], Proposition 10. 

Developing some limit theory for multivariate Fejer kernels was the key point in several papers [12], 
[14], [13], [15] which generalized the Breuer and Major central limit theorem [27]. The papers above 
introduced a new mathematical object to be called "Fejer graph kernels" - see (|2.4p in section [2j 
which captures the common structure of several cumulant computations. Replacing the cyclic graph 
encountered in the specific case of quadratic forms in Gaussian random variables by an arbitrary 
graph, these papers reduce the central limit theorem for a large class of problems involving Appell 
polynomials in Gaussian or moving average summands to an application of three analytical tools: 
(1) Identifying the graphs involved by applying the well-known diagram formula for computing 
moments/cumulants of Wick products - see section lEl 



^For the definition of Appell polynomials see, for instance, Avram and Taqqu (1987), or Giraitis and Surgailis (1986) 
2 This convergence of measures may also be c 
(see Theorem IC.l \ , using estimates of the form 



2 This convergence of measures may also be derived as a consequence of the Holder- Young-Brascamp-Lieb inequality 



\\A T \\ s -i < k(s v )T d(1 - s ^ 

with optimally chosen s v , v = 1, • • • , V. 
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(2) Applying a generalization of a Grenander-Szego theorem on the trace of products of 
Toeplitz matrices to the Fejer graph integrals - see section [21 to obtain the asymptotic 
variance. This theorem is valid under some general integrability assumptions furnished by the 
Holder- Young-Brascamp-Lieb inequality. 

(3) The resolution of certain combinatorial graph optimization problems, specific to each applica- 
tion, which clarify the geometric structure of the polytope of valid integrability exponents on 
the functions involved (spectral density, kernel of the quadratic form, etc). 

Here, we observe that a similar approach works in the multidimensional and continuous indices 
case. More precisely, the only changes are a) the normalizations, which change from T to T d , and 
b) the condition for the validity of the Holder- Young-Brascamp-Lieb inequality (see Appendix C) in 
the continuous case. Therefore, the previously obtained central limit theorems continue to hold in 
the multidimensional case, including continuous indices, after simply adjusting the normalizations and 
integrability conditions. 

Statistical applications. The two cases most easy to study are that of Gaussian and linear 
processes. In the applications Section U] we will work assuming that Xt is a linear process (see (|4.ip or 
(14. 3ft below) . This assumption has the advantage of implying a product representation of the cumulant 
spectral densities - see for example Theorem 2.1 of [6]. Namely, for the cumulants we get the explicit 
formula 



Jsei j=1 

and in the spectral domain, we get 



k-l k-l k k 

/ t (A lr .,An)=4a(-^A,) Ha(\ i ) = d k l[a(\ l )5(Y J ^) (0-3) 

i=i i=i i=i j=i 

(the meaning of parameters d}~ and a function a(A) is clarified in section |4"TT]) . 
For k = 2, we will denote the spectral density by /(A) = /2(A) = d2d(\)a(— A). 

Definition 0.2. Let 

M^)H; rT ' :; r ^ «.>.j> 




denote the closure of the functions in L p which are continuous, bounded and of bounded support, 



under the L p norm. 

Note: In the torus case, this space intervenes in the proof of theorem 13. 1} which is established 
first for complex exponentials, and extended then to the Banach space of functions which may be ap- 
proximated arbitrarily close in L p (dfj,) sense by linear combinations of complex exponentials, endowed 
with the L p norm. 

Considering bilinear forms Qt we will work under integrability assumption: 
Assumption A: 

/ € L Pl (dfj,),b G L P3 (d/i), 1 <pi < 00, i = 1,2 

Note: 1) While a general stationary model is parameterized by a sequence of functions /^(Ai, Afc_i), 
k £ N, the linear model (|0,3p is considerably simpler, being parametrized by a single function a(A). 
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2) We expect all our results may be formulated directly in terms of characteristics of the field Xt, 
which suggests that the moving average assumption is probably unnecessary; indeed, more general 
results which make direct assumptions that functions /fc(Ai, \k-i) belong to some special L p -type 
spaces, have been obtained in certain cases - see, for example, [15]. 

Contents. We present a warmup example involving quadratic Gaussian forms in Section [TJ The 
results here are closely connected to those of the paper by Ginovian and Sahakyan (2007). We define 
the concept of Fejer graph and matroid integrals in Section [2j We will consider here only the first case 
(i.e. graphic matroids associated to the incidence matrix of a graph). 

Some limit theory (of Grenander-Szego type) for Fejer graph integrals is reviewed and extended 
to the continuous case in Section [3l Various estimates concerning kernels are collected in Appendix 
|B| and a particular case of the Holder- Young-Brascamp-Lieb inequality required here is presented in 
Appendix O 

In Section E] we introduce the linear model (which extends the Gaussian model) and develop several 
applications. Note here the existence of a different approach, due to Peligrad and Utev (2006), who 
established the central limit theorem for linear processes with discrete time and dependent innovations 
including martingale and mixingale type assumptions (see also the references therein for this line of 
investigation) . 

To make the paper self-contained we supply in Appendices the material we refer to in the main part 
of the paper. 

I. AN EXAMPLE: THE CENTRAL LIMIT THEOREM FOR GAUSSIAN BILINEAR FORMS 

We present first our method in the simplest case of symmetric bilinear forms Qt = Qj-^ = Qt(P\\) 
in stationary Gaussian fields X t , with covariances r[t — s), t, s G I and spectral density /(A) (note 
that = J teIj , Xtv(dt) is "too simple" for our purpose, since it is already Gaussian and and its k-th 

order cumulants Xk(S^) = 0, \/k ^ 2). The presentation follows [14] for the discrete case and [38] for 
the continuous case, except that we clarify the point that the previous results hold in any dimension 
d. 

To obtain the central limit theorem for T~ d ^ 2 Q^' 1 ^ by the method of cumulants it is enough to 
show that: 

/<# 1} \ fQ { T' 1} \ 

A direct computation based on multilinearity yields the cumulants of Q^^'- 

Xk = x(Qt, Qt) = 2 k - l (k - 1)! Tv[(T T (b)T T (f)) k }. (1.1) 
Here, Tr denotes the trace and 

T T (b) = (b(t-s), t, a € J T ), T T (f) = (r(t -s),t,se I T ) 

denote Toeplitz matrices (with multidimensional indices) of dimension T x T d in the discrete case 
and truncated Toeplitz-type operators in the continuous case 0. 



^Recall that in continuous case, the truncated Toeplitz-type operator generated by a function / £ -Loo, is defined for 
u G L2 as follows 

T T (f)u(t) = f f(t-s)u(s)v(ds). 
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While the cumulants in (jl.ip may be expressed using powers of two Toeplitz matrices, it turns out 
more convenient in fact to consider more general products with all terms potentially different (taking 
advantage thus of multi- linearity) . 

Suppose therefore given a set / e (A) : S — > R, e = 1, . . . ,n of "symbols" associated to the set of 
Toeplitz operators, where (S, dfj,) denotes either R rf with Lebesgue measure, or the torus [— ir, 7r] d with 
normalized Lebesgue measure. Assume the symbols satisfy integrability conditions 

f e eL Pe (S,dfi), l<p e <oo. (1.2) 

Let f e (k), k G I, be the Fourier transform of / e (A): 

f e (k) = [ e lkX f e (\)n(d\), k 6 /, 
Js 

where I = Z d in the torus case and / = R rf in the case S = R d , respectively. In this last case, we would 
also need to assume that f e € Li(M. d ,dfi), for the Fourier transform to be well defined. Consider the 
extension of our cumulants: 

n 

J T = TrfQWi)] 

i 

„ n 
= / hU2-jl)hti3-h)-Jn(jl-jn)llv(dj v ). (1.3) 

Jh 7«e/T v= i 



Replacing the sequences f e (t) by their Fourier representations f e (t) = f s f e (X)e ltx dX in (|1.3p yields 
the following alternative spectral integral representation for traces of products of Toeplitz matrices or 
truncated Toeplitz operator^: 

„ n n 

Jt= /i(Ai)/ 2 (A 2 ).../ n (A n )nA r (A e+1 -A e )[]MdA e )= (1.4) 

JXi,...,x n es e=1 e=1 

r. „ Ti—l n— 1 

/ (/ fi(\)f2{\ + u 1 )..J n (\ + J2u e )d\)<S> T (u 1 ,...,u n _ 1 )Y[fi(du e ), 

Jui,...,u n -i&S ^ J \eS i ' e=1 



where the index n + 1 is defined to be equal 1, and where 

72 — 1 71 — 1 



1 e=l 



which, after normalization with the factor — ^-\)a, Ti ■> yields the "multivariate Fejer kernel" 



Of course, the two expressions Jt, Jt are equal if f e £ Li, e = 1, n. Note however that the "spectral representation' 
(11.41) is well defined even without the last condition. 
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Note that the inner integral 

C(u) = C (/li ... Jn) (ui,...,n„_i) := 

r. 71— 1 

/ /i(A)/ 2 (A + ui).../„(A + yV)dA, (1.5) 
Jxes x 

to be called a graph convolution, is well defined precisely under the classical Holder conditions, when 
the integrability indicies in (1.2) satisfy: 

jEePe 1 <lwhenS = Z d 

!>d \ ' J 



J2 e Pe 1 = 1 when S 



The resulting integral 



. re— 1 

Jt= C(ui,...,v^)^ T {ui,... ) u n -i)Y[fj l (d 

Jui,...,u n -ies i 



e=l 

is our first example of a "Fejer graph integral" to be introduced in general in the next section. These 
are integrals involving products of Dirichlet kernels and functions, applied to linear combinations, 
which are related to the vertex-edge incidence structure of a certain directed graph (in the occurrence, 
the cyclic graph on the vertices {1, ...,n}). 

The second expression in the RHS of (jl.4p reveals the asymptotic behavior of Fejer graph integrals, 
since, as noted, when T — > oo, the multivariate Fejer kernel ^ convergences weakly to a S 
measure. 

The kernel lemma 10.11 will imply that 

T - d J T -» (2vr)( n - 1 ) d C(0,...,0) = / A(A)/ 2 (A).../ n (A)/i(dA) 

Jxes 

provided that we check that the function C(m, it n -i) is bounded and continuous. This is indeed 
true, as stated in the next result: 

Lemma 1.1. The "graph convolution" function C(u±, it n -i) defined in U.5\) is bounded and con- 
tinuous if holds with integrability indices satisfying the condition U.6\) . 

Proof: Note that the function C : S^™ 1 — > M is a composition 



of the functional 



defined by @ 



C(ui, ...,« n _i) - C(T 1 { Ul ,...,u n - 1 ),...,T n {ui,...,u n - 1 )) 

n 

e=l 
„ n 

C (h,.,U) ■= / II /e(A)//(dA) 

JAGS 



^Note again this is well denned precisely under the classical Holder conditions. 
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with the continuous functionals 

T e (m, ...,ii n _i) : S 12 " 1 -> L Pe 

defined by T e {u\, ...,u n -i) = / e (- + YT V =\ u v),e = 1, 

Indeed, the continuity of the functionals T e is clear when f e is a function which is continuous, 
bounded and of bounded support, and this continues to be true for functions f e 6 L Pe , since these can 
be approximated in the L Pe sense. In conclusion, under the Holder assumptions, the continuity of the 
functional C(u\, ...,u„_i) will follow automatically from that of C(/ lv . .,/„)• 

Finally, under the "Holder conditions" (jl,6p , the continuity as well as boundedness of the function 
C(fi,...,f E ) follow from its multi-linearity and from the Holder inequality: 

n 

l C (/l,...,/„)l < II WfeWp* ■ 
e=l 

Theorem 1.2. Let f e £ Lp e ,e = l,...,n where 1 < p e < oo,e = l,...,n, and let Jt,Jt be defined by 
/jl-4\ ), lll-3\) respectively. Then, it follows that: 

(1) If the integrability indices satisfy the Holder conditions 

J2ePe X <lwhenS = Z d 
Y.ePe 1 =lwhenS = R d , 

then 

lim T~ d J T 

(2) If a := YlePe 1 > 1) then it holds that 

J T = (T ad ). 

(3) If in the continuous case it holds in addition that if f e £ L Pe flLi,e = 1, ...,n, then Fourier 
coefficients / e (fc) may be defined, and the previous results hold also for Jt = Tr[n™ =1 ^r(/e)]- 

Notes: 1) In the second case, the exact exponent of magnitude is unknown, except for the upper 
bound a d. 

2) The result above is a refinement of a limit theorem of Grenander and Szego [46] concerning traces 
of products of truncated Toeplitz operators. Under the current strengthened integrability conditions, 
it was obtained when d = 1 in the discrete case in [14] and in the continuous case in [38]; as we 
show below, the result holds in fact in any dimension d (after modifying the necessary integrability 
condition for the asymptotic variance, in accordance to the Holder- Young-Brascamp-Lieb inequality). 

Following the proof of [12], we see that part 1 of Theorem 11.21 follows from Lemmas 10 . 1 U 1 . 11 ab ove . 
yielding the convergence of the normalized variance: 

lim ^ = J(0,...,0) = / h{\)f 2 (\)...f k (X)^d\). 
Part 2 (see the proof of Theorem 3.1(c) below) implies that for k > 3, the cumulants satisfy 



„ n 

Jxes fj{ 



hm ,, /n = 0, 
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and implies asymptotic normality. We arrive thus at the following multidimensional generalization of 
the results of Avram [10] and Ginovian [38]. 

Theorem 1.3. Consider the quadratic functional 

Qt = Q [ t 1] = [ [XtX, ~ ®X t X s ] b(t - s)v{ds)v{dt), 

where Xt, t G I is a Gaussian random field with spectral density /(A) G L p . Assume the generating 
function 6 of the quadratic functional is such that 6(A) G L ? , that: 

1 1 1 

p q 2 

and that in the continuous case we have also 6(A), /(A) G L\. 
Then, the central limit theorem holds: 

UmT~ d/2 Q T -» N(0,a 2 ), T —> oo, 

where 

a 2 := 2(2Tr) d f b 2 (X)f 2 {X)d\. (1.8) 
Js 

We present now one more result for Gaussian fields, which is related in the discrete case to the 
classical result of Breuer and Major [27], and in the continuous case to the result of Ivanov and 
Leonenko [53]. We note that these authors worked under time-domain assumptions, however, reasoning 
in the spectral domain with the methodology of [14] and the present paper - see Example 13.51 - 
immediately lead to the following result. 

Theorem 1.4. Let Xt, t £ I, be a Gaussian random field with spectral density /(A) G L p . Let 
St = J te j T Pl(Xt)v(d£), where Pi(Xt) are univariate Appell (Hermite) polynomials and I > 2. Assume 
that: 

z:=p -i<l- (1.9) 

Then, 

„ l-l 

CT 2. = / (*,0( ) = / f(yi)f(y2-yi)..-f(vi-x-Vi-2)f{-yi-i)]ldyi<oo. (1.10) 

•>Vl,—M-l£S i=1 

If, moreover, a 0, then the central limit theorem holds 

\i m T- d / 2 S T -> N(0,a 2 ). 

Note: The difference between the integral representations of the variances (|1.8p and (jl.lOp will be 
explained via graph theory in the next section. 

2. Fejer graph/matroid INTEGRALS AND graph/matroid CONVOLUTIONS 

In this section, we introduce a unifying graph-theoretical framework for problems similar to those 
of the previous section. 
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Definition 2.1. Let G = (V,£) denote a directed graph with V vertices, E edges, a basis of C 
independent cycles d and co(G) components. The incidence matrix M = {M v e }veV ee£ of the 
graph is the V x E matrix with entries [v, e] = ±1 if the vertex v is the end/start point of the edge e, 
and otherwise. 

A circuit matrix M* is a C x E matrix whose rows are obtained by assigning arbitrary orientations 
to a basis of circuits (cycles) c = 1, ...,C of the graph, and by writing each edge as a sum of ± the 
circuits it is included in, with the ± sign indicating a coincidence or opposition to the orientation of 
the cycle 

Besides the graph framework, we will hint also to possible matroid generalizations. To clarify this 
point, let us start by quoting Tutte: "it is probably true that any theorem about graphs expressible 
in terms of edges and circuits exemplifies a more general result about vector matroids" . 

Let us recall briefly that matroids are a concept which formalizes the properties of the "rank 
function" r{A) obtained by considering the rank of an arbitrary set of columns A in a given arbitrary 
matrix M. More precisely, a matroid is a pair £, r : 2 £ — > N of a set £ and a "rank like function" r(A) 
defined on the subsets of £. Matroids may also be defined in equivalent ways via their independent 
sets, via their bases (maximal independent sets), via their circuits (minimal dependent sets), via their 
spanning sets (sets containing a basis), or via their flats (sets which may not be augmented without 
increasing the rank). For precise definitions and for excellent expositions on graphs and matroids, 



The most familiar matroids, called vectorial matroids, are defined by the set £ of columns of a 
matrix and by the rank function r(A) which gives the rank of any set of columns A (matrices with 
the same rank function yield the same matroid). 

Some useful facts from matroid theory are the fact that to each matroid M one may associate a 
dual matroid, with rank function 



For vectorial matroids, the dual matroid is also vectorial, associated to any matrix whose rows span 
the space orthogonal to the rows of M. Furthermore, in the case of graphic matroids, the dual matroid 
is associated to the circuit matrix. 

Tutte's "conjecture" holds true in our case: a matroid Szego-type limit theorem was already given 
in [14]. However, for simplicity, we will restrict ourselves here to the particular case of graphic 
matroids associated to the incidence matrix M of an oriented graph. In this case, the proofs are 
more intuitive, due to the fact that the algebraic dependence structures translate into graph-theoretic 
concepts, like circuits corresponding to cycles, etc. 

From here on, we will restrict ourselves to the graphic case, i.e. to the case when our dependence 
matrix is the incidence matrix of a directed graph. 

Let (S, d/u,) denote either R d with Lebesgue measure, or the torus [— 7r, ir] d with normalized Lebesgue 
measure, and let / e (A) : S — ► R, e = 1, . . . , E denote a set of functions associated to the columns of 
M, which satisfy integrability conditions 



a basis of cycles is a set of cycles, none of which may be obtained via addition modulo 2 of other cycles, after ignoring 
the orientation 



see [61], [62] or [72]. 



r*(A) = \ A\ - r{M) + r(£ - A). 



f e 6 L Pe (S,dn), 1 < p e < oo. 
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Let f e (k), k G I, the Fourier transform of / e (A): 

f e (k) = [ e ikX f e (\)fi(d\), k G /, 



where I = % d in the torus case and I = R d in the case S = R , respectively. In this last case, we would 
also need to assume that f e G L\(M. d ,dfi), for the Fourier transform to be well defined. However, all 
our analytic results concern the spectral domain, and hence this assumption will not be necessary. 
Our object of interest, in its "time domain representation", is: 

Jt = J T (M,f e ,e = l,...,E) 

f V 
= / h{h)f2{i2)...fE{i E )J[v{dj v ), (2.2) 

J 71 1v€It v= i 



where i = (ii,...,^) = (ji, jy)M = jM, where v{dj v ) stands for Lebesgue measure and counting 
measure, respectively, and where in the torus case the linear combinations are computed modulo 
[—it, ir] d , so that the linear map jM :S V ^S E is well defined. 

Note. To keep the transparent analogy with the case, when d = 1, we make the following convention 
concerning notations. Here and in what follows let us treat a product of a vector, whose components 
are d-dimensional, and a matrix (or another vector) with scalar components in a specific sense: we 
will still perform multiplication component- wise according to the usual rule, and as a result we obtain 
a vector, whose components are d-dimensional again (or, correspondingly, just d-dimensional vector). 

A Fejer graph integral is the expression obtained by replacing the sequences f e {t) in (|2.2p by 
their Fourier representations f e (t) = f s f e (\)e ltX [i(d\): under the assumption f e G Li(S, d/i), an easy 
computation (see [12], Lemma 1) shows that (12. 2ft may be written also as the integral (12. 4j) below. We 
introduce however a more general concept. 

Definition 2.2. Let (S,d[i) denote either M d with Lebesgue mesure, or the torus [— vr,7r] d with nor- 
malized Lebesgue mesure. Let M be a matrix of dimensions V x E, with arbitrary coefficients in the 
first case and with integer coefficients in the second case. Let / e (A) : S — > R, e = 1, . . . , E denote a set 
of functions associated to the columns of M. Suppose these functions satisfy integrability conditions 

f e G L Pe (S,dfj,), 1 < p e < oo, (2.3) 

A Fejer matroid integral is defined by the following "spectral representation" : 

J T = J T (M,f e ,e = l,...,E) (2.4) 

V E 

/i(A 1 )/ 2 (A 2 ).../ £ (A £ ) \{ A T (u v ) n(d\ e ) 



v=l e=l 



where Aj(«) is a kernel defined by h0.1\) . where (u\, uy) 1 = M{\\, \e)', and where in the torus 
case the linear combinations are computed modulo [— vr, ir] d . 

A Fejer matroid integral will be called a Fejer graph integral for graphic matroids associated to 
the incidence matrix M of a directed graph G. In this case, the functions and kernels in \2.J$ are 
associated respectively to the edges and vertices of the graph. 
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The cycle graph/Toeplitz example Consider the particular case of a cyclic graph with n edges. 
In this case, the matrix M with n columns and rows, is: 



f -1 
1-10 



M 



1 \ 








1 



-1 





V o o 



-1 



-1 / 



and its Fejer graph integral is given by: 



Jt 



\\,...,\ n &S 



/i(Ai)/ 2 (A 2 ).../ n (A n ) J] A T (A„ - A„ + i) \\ fi(dX e ). 



v=l 



e=l 



Note. For analytical results concerning only Fejer matroid integrals as defined by (|2,4|) . the condi- 
tion f e G Li(S,dfJ,) is unnecessary. 

3. Limit theory for Fejer graph integrals 

The main points of the limit theory for Fejer graph integrals, to be presented now, are that: 
(1) Under certain Holder- Young-Brascamp-Lieb conditions necessary to ensure the existence of 
the limiting integral, the following convergence holds as T —* oo: 



c 



sc 



/i(Ai)/2(A 2 ).../ B (A B )JJ/i(dy c 



c=l 



(3.1) 



where (\i,...\e) = (yi, ■-, yc)M* (with every A e reduced modulo [— it, Tr] d in the torus case), 
M* being any matrix whose rows span the space orthogonal to the rows of M, and C being the 
rank of M*. Informally, the kernels disappear in the limit, giving rise to the "dual matroid" 
M*. 

(2) When the Holder- Young-Brascamp-Lieb conditions do not hold, then, cf. part c) of the theorem 
below, the normalization defined in part a) will lead to a zero limit. 

Theorem 3.1. Suppose that f e £ L Pe (dfi) for part a), and f e G L Pe (d^) for parts b),c) and set 
z= (pf 1 ,...,^ 1 ). 

Let Jt = Jr(M, fx, /jg) denote a Fejer matroid integral and let r(A),r*(A) denote respectively 
the ranks of a set of columns in M and in the dual matroid M* . 

Suppose that for every row I of the matrix M , one has r{M) = r{M{), where Mi is the matrix with 
the row I removed. Then: 

a) 



J T (M,fx,...,f E )<c M T 



d a M (z) 



(3.2) 



where cm is a constant independent of z and 



14 



TITLE WILL BE SET BY THE PUBLISHER 



I in the discrete case 0^(2) is given by (jg.ffp 
I in the continuous case cxm{z) = a c M (z) = co(M) + Q^ e z e — C). 
b) If o,m(z) = V — r(M) = co(M), or, equivalently, 



in the discrete case 
in the continuous case 



(3.3) 



then 



where 



c 

JlW.h lis)-- I fi(\i)f2(\2)...fE(\E)Hv(dy c 

S ° c=l 



(3.4) 
(3.5) 



and where (Xi,...Xe) = ■■■■>yc)M* (with every X e reduced modulo [— vr,7r] d in the discrete 
case), and C denotes the rank of the dual matroid M* . 
c) If a strict inequality o<m( z ) > co(M) holds, then the inequality h3.2\l of Theorem VJ.1\ a) may 
be strengthened to: 

J T (M) = o(T daM{z) ) 

Remark: The results of this theorem, that is, the expression of (xm(z) an d the limit integral J(M*) = 
J~(M*, /1, /#), as well as the convergence conditions of integrals depend on M, M* only via the 
two equivalent rank functions r(A),r*(A), i.e. only via the matroid dependence structure between the 
columns, and not on the chosen representing matrices. 

Proof: The proof of part b) of Theorem 13.11 is essentially identical with that given in [12], up 
to the modification of the integrability conditions and the appearance of the extra constant ku- For 
completeness, we sketch now this proof, for a connected graph (w.l.o.g.). 

Note first that in a connected graph there are only V — 1 independent rows of the incidence matrix 
M (or independent variables Uj), since the sum of all the rows is (equivalently, uy = — J2v=i u v)- 
Thus, r(M) = V — 1, co{M) = 1, and the order of magnitude appearing in the normalization is just 

rpd 

The main idea behind the proof of Theorem 13.11 b) are a change of variables and applying the 
continuity of graph convolutions: 

(1) Change of variables. Fix a basis yx,...,yc in the complement of the space generated by 
the u v 's, v = 1, . . . , V, switch to the variables u±, ...,uy-i,yi, yc and integrate in (|2.4p first 
over the variables y c 's, c = 1, . . . , C. This is more convenient in the graphic case, since, after 
fixing an arbitrary spanning tree T in the graph, the complementary set of edges T c furnishes 
a maximal set of independent cycles (with cardinality C). Assume w.l.o.g. that in the list 
(Ai, Xe), the edges in T c are listed first, namely (A e ,e G T c ) = (Ai, Ac). We make 
the change of variables y\ = Xi,...,yc = Ac, and (tfi, uy-i)' = M(X\, Xe)' , where M 
denotes the first V — 1 rows of the incidence matrix M. Thus, 



(yi, ...,yc,ui, ...,u V -i)' 



\M C 





M v 



(Ai, x E y 
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where the first rows are given by an identity matrix Iq completed by zeroes and where Mc, My 
denote the first C columns/ next V — 1 — C columns of the matrix M. 
Inverting the transformation above yields: 

(Ai,...,A B ) = {yi,...,y c ,ui,...,u V -i) jvf-i J = W 1 ' -' yc > Ul ' -' u V-i) y N J > ( 3 - 6 ) 

that is, it turns out that the first rows of the inverse matrix are precisely the dual matroid 
M*. 

Definition 3.2. The function 

c 

h M *,N(ui, ...,u r(M) ) = / /i(Ai)/ 2 (A 2 ).../i?(A £ ;) JJd/i(y c ) (3.7) 

J v\ vn&S i 



c=l 



where A e are represented as linear combinations of y\, ...,yo,U\, ...,u\r—i via the linear transfor- 
mation (|3.6j) will be called a matroid/graph convolution depending on whether the matroid 
is graphic or not. 

The change to the variables y\, yc, u±, uy-i and integration over yi, ...,yc transforms 
the Fejer graph integral into the following integral of the product of a "graph convolution" 
and a Fejer kernel: 

„ V V-l 

J T (M)= h M *,N(ui, ...,u V -i)Y\_A t (u v ) dn(u v ). 

Ju 1 ,...,u v _ 1 es v=l v=1 

Recalling that the Fejer kernel converges under appropriate conditions to Lebesgue measure 
on the set U\ = ... = uy-i = 0, we find, just as in the cycle case, that part b) of Theorem 
13.11 will be established once the convergence of the kernels and the continuity of the graph 
convolutions h(u\, u r i M \) in the variables (ui, u r i M \) is established. 

(2) 

Lemma 3.3. The continuity of graph convolutions. The "graph convolution" function 
C(u\, u n -\) defined in ( ti.5j) is bounded and continuous if lil.ty) holds with integrability in- 
dices satisfying the power counting condition (1.6). 

The proof is essentially the same as in the cycle case. Note that the function h : — ► R 
is a composition 

hM*,N(ui, ...,uv-i) = J(M*,Ti(/i), ...,T E (f E )) 
of the continuous functionals 

T e (ui, ...,uv-i) ■ M v_1 L Pe 

and of the functional 

E 

J(M*,f u ...,f E ): J]L Pe ^R. 



e=l 
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The functional T e is defined by T e (ux, ...,uy-i) = fe{- + Yliv u vN v ,e), where the N v>e are the 
components of the matrix N in f|3.6[) . The functionals T e are clearly continuous when f e 
is a continuous function, and this continues to be true for functions f e £ L Pe , since these 
can be approximated in the L Pe sense by continuous functions. Thus, under our assump- 
tions, the continuity of the functional Hm* n(ui, ...,uv-i) follows automatically from that of 
J(M*,h,...,f E ). 

Finally, under the "power counting conditions" (|3.3|) . the continuity of the function J(M* , fx, ■ 
follows from the Holder-Brascamp-Lieb-Barthe inequality: 



\J(M*,fx,...,f E )\<H\\f, 



e\\p e 

e=l 



(see Theorem IC.ip . 

In conclusion, the convergence of the Fejer kernels to a 5 measure implies the convergence 
of the scaled Fejer graph integral 

J T (MJ e ,e = l...,E) to J(M*,f e ,e = l...,E), 

establishing Part b) of the theorem. 
The proof of parts a), c) are postponed to section 3.2. 
Remarks: 1) In the spatial statistics papers ( [7], [9]), the continuity of the graph convolutions 
hM*,N{ui, uv-i) was assumed to hold, and indeed checking whether this assumption may be relaxed 
was one of the outstanding difficulties for the spatial extension. 

2) It is not difficult to extend this approach to the case of several components and then to the 
matroid setup. In the first case, one would need to choose independent cycle and vertex variables 
yx, —,y r (M*) an d ux, ■ UffM)} n °t e the block structure of the matrices, with each block corresponding 
to a graph component, use the fact that for graphs with several components, the rank of the graphic 
matroid is r(M) = V — co(G) and finally Euler's relation E — V = C — co(G), which ensures that 

E = (V — co{G)) + C = r(M) + r(M*). 

3) An important feature of the discrete case is that the limiting result (relation (|3.4p ) when f e are 
complex exponentials is straightforward, implying therefore immediately theorem 13.11 in this case, by 
the multilinearity of J(M* , fx, /e) and T~ d J(M, fx, /e) and by Lemma l3~il below. 

Lemma 3.4. For any matrix with r(M) = V — 1, any set of integers b = (b e ,e = 1, ...,E), and any 
functions / e (A e ) = e lXebe ,e = 1,...,E, theorem \3.1\ holds, i.e.: 



S G 



Proof: Aside from the constant ku, the stated RHS (limiting value) above is: 
f e i<x > b > J] ^dVc) = [ e l <y> M * b > J] n(dy c ) = 1 MH=0 = l beR{M) 

c=l c=l 



where R(M) denotes the subspace generated by the rows of M. 
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Now the LHS in (|3.4p . before scaling, is: 
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J<b,\> 



-i<sM,\> 



TV 



fi u(d Sv ) n fi(d\ e ) = / n v{ - ds ^ \ e i<b - sM ' x> n k^o) 

v=l e=l u=l e=l 

= / II zy (^) 1 b-sAf=o = u(s : sM = b,s £ I T ) 

v=l 



Now for a matrix with integer entries it holds that: 

E(T) := v(s :sM = b,se I T ) 

is either 

E(T)l =0 ttb£R(M), 
|~ T D ^(/i n feer(M)) if 6 € R(M), 

the second statement being tantamount to the definition of the Lebesgue measure. Thus, the result 
holds with ku = /i(ii fl ker(M)). See for more details [14]. 

Note also that £7(T) is an "Ehrhart quasi polynomial", whose next coefficients are related to other 
geometric characteristics of Ii, which should allow developing correction terms to Theorem 13.11 

Note: The lemma above may be interpreted as saying that the measures on (S) E given by the 
"multiple Fejer kernels" 

v-i v-i 
T- d A T (- A,) [J (A T (X v )u(d\ u )) 

v=l v=l 

converge weakly as T — > oo to the uniform measure on the subspace A = yM* (since the Fourier 
coefficients converge). 

3.1. The upper bound for the order of magnitude of Fejer matroid integrals 

We turn now to the "upper bound exponent" ajvf(^) for the order of magnitude of Fejer matroid 
integrals (useful when it is not precisely d). In the discrete case [14], the exponent o.m(z) of this upper 
bound turns out to be d times the solution of a graph optimization problem: 







= co(M) + A max^[^ Zj - r*{A)\ 


(3.8) 


or equivalently, 










olm{z) 


= A max s [ CO (M - A) - ^(1 - Zj )\ 


(3.9) 



where co{M — A) represents the number of remaining components, after the edges in A have been 
removed, and, for a general Fejer graph integral, we define 



co(M - A) = V — r(M — A). 



(3.10) 



Note that for connected graphs and under the power counting conditions X^eA 2 ? — r *(^)> this 
exponent reduces to d, as in Theorem 11.21 
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We will call the problem (|3.9p a graph breaking problem: find a set of edges whose removal 
maximizes the difference between the number of remaining components and YljeA^ ~ z j)' 
or, equivalently, the difference between Ylj^A z j ana - the dual rank r*(A). 

Note: In the following examples an important role is played by the "maximal breaking" A = M 
and "no breaking A = 0" sets A, which yield often the solution of the optimal breaking problem. 
It is useful to introduce therefore the lower bound: 



a c M (z) = max [co(M -A)- ^(1 - zj)] = max{ co(M), ^{z e — 1) + V} 

= max{co(M), co(M) + ^ z e - C} = co(M) + (J^ z e - C) + , (3.11) 



e 



where the equality before the last holds by Euler's relation C = {E — V)+ (C denotes the number of 
cycles, and, more generally, the rank of the dual matroid M*). 
Note that in the case of a cycle graph of size m, this reduces to 



«m0) = max{l,^z e }, 



e=l 



as stated in Theorem 11.21 and that the expression (|3.1ip turns out to yield the upper bound exponent 
in the continuous case. 

Note: Theorem 13.11 c) may be used for establishing convergence to of higher order cumulants, 
whenever these may be written as sums of Fejer graph integrals, by computing bounds of the form 

rpda k (z) 

for Fejer graph integrals intervening in the cumulants of order k. Since the typical CLT normalization 
is T d l 2 , it will suffice then establishing "cumulant inequalities" 



dak{z) < kd/2 ^ ak(z) < k/2 



where ctk(z) is the exponent appearing in the expansion of the k-th cumulant. In fact, this may be 
strengthened (cf. Theorem 13. ip to include z satisfying the equality ctk(z) = k/2, if k > 3. 

In conclusion, establishing normality is reduced to computing the functions ak(z),k > 3, i.e., to 
solving a sequence of graph breaking problems. 

Example 3.5. The general structure of the intervening graphs for the fc'th cumulant of sums St of 
the m'th Appell polynomial of a Gaussian sequence is provided by graphs belonging to the set T(m, k) 
of all connected graphs with no loops over k vertices, each of degree m (see [14]). Let z = p^ 1 denote 
the integrability exponent of the spectral density. 

The cumulant inequality corresponding to the "maximal breaking=MB" of the fc'th cumulant 
graph, which typically yields a facet of the power counting polytope (PCP) (at least for k = 2) is 
given in this example by: 

«*(*) = 2_&e ~l)+V= ~7t{z -l)+k<-A* — <l-Z 
^-^ 2 2 m 
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At the limiting point 1 — z = -jj-, the cumulant exponents are a.k(z) = |, ensuring negligibility for 
k > 3. 

The discrete and continuous case may be unified here by asking for integrability at z = 1 — -, since 
in the discrete case the extension to smaller values of z is trivial. 

Example 3.6. A similar analysis holds in the case of cumulants of quadratic forms in Appell poly- 
nomials P m ,n(Xt, X s ). Note that while the number of graphs intervening increases considerably, the 
number of extremal points of the PCP is just 4 - see Figure [2j 




Figure 1. The graphs appearing in the expansion of cumulants of qua- 
dratic forms. Here k=4, m=5, n=4. The figure displays only some of 
the k(m+n)/2=18 correlation edges. 



The graphs intervening are - see Figure Q]- all the graphs belonging to the set T(m,n,k) of all 
connected bipartite graphs with no loops whose vertex set consists of k pairs of vertices. The "left" 
vertex of each pair arises out of the first m terms : Xt x , ...,Xt m : in the diagram formula, and the 
"right" vertex of each pair arises out of the last n terms : X Sl , ...,X Sn : The edge set consists of: 

(1) k "kernel edges" pairing each left vertex with a right vertex. The kernel edges will contribute 
below terms involving the function b(X). 

(2) A set of "correlation edges", always connecting vertices in different rows, and contributing 
below terms involving the function /(A)). They are arranged such that each left vertex connects 
to m and each right vertex connects to n such edges, yielding a total of k(m + n)/2 correlation 
edges. 

Thus, the k "left vertices" are of degree m + 1, and the other k vertices are of degree n + 1. (The 
"costs 1 — Zi, 1 — Z2 mentioned in Figure Q] refer to (|3.8j> ) . 

The PCP domain in the discrete case (which is precisely the convergence domain of the integrals 
defining the limiting variance), is indicated below, when m < n, in terms of the integrability indices 
z = (z\,Z2) of / and b (i.e. / € L pi , b G L p2 ) . 
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(0,1/2) 



D 



m,n 



(l-l/m,l/2) 




X m z i + 2 z> = m 






\ 

\ 

\ 

1 

i 

\ 
\ 
\ 

i 



(l-l/n,m/(2n) 



z l 



A(l-l/(m+n), 0) 



Figure 2. The domain of the central limit theorem, discrete case 



When m < n, there are only three segments on the undominated boundary of the PCP, connecting 
respectively the extremal points (A,B), (B,C) and (C, D)(with coordinates A(l — l/(m + n),0), 
B(l— l/n,m/(2n)), C(l— 1/m, 1/2), D(0,l/2)), and correspond respectively to the breakings indicated 
below: 

total breaking 2k - _ Zl ) _ k(l - z 2 ) < § 

(zn+n) (1 _ Zi) + (1 _ Z2) 

breaking all z 2 and the left z\ edges maxfc — m(l — z\) — (1 — zi) < ^ 

4^ 1 < m(l - zi) + (1 - z 2 ) 
breaking all edges max^ | — (1 — z 2 ) =<\ 



In the continuous case, the domain is just the lower segment between the points A and B in figure 

13 



3.2. Proof of Theorem 13. 1\ parts a), c) 

We turn now to theorem 13,11 a) , c) , generalizing Theorem 2 and Corollary 1 of [14] . 
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For part a), let us apply the Holder- Young-Brascamp-Lieb inequality with optimally chosen inte- 
grability parameters s^ 1 : 



V E 



\J T (MJ l ,...,f E )\<KH\\A T \\ s - 1 ll\\f e \\ z -i (3.12) 

u=l e=l 
E 

< K'T d SLiCi-^ 1 ) Y[ \\f e \\ z -i (3.13) 

e=l 

under the constraint that (si, sy, z±, ze) satisfy the power counting conditions, and where we 
used the kernel estimate 

|A T (A)| 8 -i KC^-'^Vse [0,1) 

(see Appendix B). 

The optimization problem for s v in the discrete case: 

v 

min d / (1 — s" 1 ), where (si, sy, z\, ze) € PCP, 

D = l 

has the same constraints as Lemma 2 in [14], except that the objective is multiplied by d. Hence, in 
the torus case, the exponent is simply d times the one dimensional exponent of Theorem 1, [14]. 

In the continuous case S = M. d , we note first that when ^ e z e < C the result follows, just as Theorem 
13. li b) . from the Holder- Young-Brascamp-Lieb inequality, while in the other case ^ e z e > C, the extra 
constraint Yl v s v = E — Y2 e z e yields the one-dimensional exponent as om(z) = co(M) + Y2 e z e ~ ^ • 

For part c), we approximate our functions by continuous, bounded functions of bounded support, 
for which conditions (3.3) of part b) hold. For these approximants, it follows from the convergence 
ft%M) -» k M J{M*,h, ...,f E ) that J T {M) = (T dc °( M )+ a )), Va > 0. The result follows then for the 
functions from L p spaces, by the definition of these spaces. Note that in the discrete case, the same 
argument was applied based on trigonometric polynomials - see [14], proof of Corollary 1. 



4. Applications 

4.1. Central limit theorems for bilinear forms of moving averages 

We assume below that our stationary random field Xt, t £ I admits a representation as a lin- 
ear/moving averages random field. 
For discrete parameter it means that 

X t = «(* - u )^{u), d 2 {u) < oo, t G Z d , (4.1) 

where £(it), u G Z, d , are independent random variables indexed by Z rf with E^(0) = and such that 
E |£(0)| fc < Cfe < oo, k = 1, 2, . . . . In this case 



k 

c k (h, ...,t k ) = cum k {X tl , X tk ] = d k y \ \\ d(tj - s), 



(4.2) 
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where dk is the fc'th cumulant of £(0). 

For continuous parameter we assume that 



Ju£R d 



X t = I a(t-u)£{du), t£R d , (4.3) 



with a square-integrable kernel a(t), t G I, with respect to a independently scattered random measure 
with finite second moment, that is a homogeneous random measure £(A), A cR d , with finite second 
moments and independent values over disjoint sets (see, for instance, Rajput and Rosinski [65] or 
Kwapien and Woyczynski [55]). That is, for each Borel A, £(A) is an infinitely divisible random 
variable whose cumulant function can be written as 



K(Z 



\ogEe iz ^ A) = izmo(A) - \z 2 m x {A) + I (e izx - 1 - izr(x)) Q{A,dx), (4.4) 

2 Jr 



where ttiq is a signed measure, mi is a positive measure, Q(A,dx) (for fixed A) is a measure on R 1 
without atoms at 0, such that J R min jl, |x| 2 | Q(A, dx) < oo, and where t(x) = x if \x\ < 1, and 
t(x) = xj \x\ , if |x| > 1. 

For example, if I = R, then £,(A) is a set indexed Levy process with finite second moments and 
stationary intensity proportional to the Lebesgue measure. 

We also assume that Q factorizes as Q{A,dx) = M(A)W(dx), where M(A) is a a— finite measure, 
and W is some Levy measure on R , such that for some s > and A > 



/ e Xu W(du) < oo. 

J(-e,e) 



This implies that 



/ \u\ k W{du) < oo, k>2, 
Jr 



and that the cumulant function n{z) is analytical in a neighborhood of 0. 

Necessary and sufficient conditions of existence of the integral (as limit in probability of integrals 
of simple functions) 



L 



f(s)d£(s) 



can be found in [65]. Note that, for d = 1, an integrals become integrals with respect to Levy process 
L(t), teR 1 , and k(z) = logEe izL ^. 

For Lebesgue measures mo, mi and Q, one can prove (by using product integration) that 



log/: oxp{i(ziX tl H h z k X tk } = J k | ^2zjd(tj - s) J d* ( 1.5) 

Rd V =1 / 



if a € L\ n L2. From (|4.5p if can be seen that random field (|4.3p is homogeneous in a strict sense. 
We assume that m\ = 0, that is, E£(/i) = 0, then the last formula holds for d £ L2. 
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We obtain that 

c k (ti,...,t k ) = cum k {X tl ,...,X t ,} = d k IT a(tj - s)ds, (4.6) 

where d k is the k'th cumulant of £(ii) with I\ being the unit rectangle, that is d k = K k (0)/i k , k > 2. 

We assume from now on that E£(/i) = 0, and use the same notation for both discrete and continuous 
cases 

X t = f a(t-u)£(du), tel, (4.7) 
Juei 

where / = Z d in the discrete case and I = M. d in the continuous case. 

For various conditions which ensure that (|4.7p is well-defined, see, for example, Anh, Heyde and 
Leonenko [6], p. 733, and references therein. 

Note: By choosing an appropriate "Green function" d(t), this very general class of processes 
includes the solutions of many interesting differential equations with random noise £,(du), like, for 
example, generalized Ornstein-Uhlenbeck processes in M [6]. 

We will assume here that all moments for our stationary field Xt exist. 

The advantage of the linear representation assumption (14. 7D and (14. 4D . (|4.5f) is the explicit repre- 
sentation of cumulants - see for example Theorem 2.1 of [6]: 

Ck(h,...,t k ) = d k Y[a(tj - s) u(ds), (4.8) 
Jsei j=1 

where d k is the k'th cumulant of £(ii) with I\ being the unit rectangle. 
In the spectral domain, we get 

fc-i fc-i fc fc 

/ fc (Ai,...,Ajfc_i)=d fc a(-J^Ai) \{a{X i ) = J[a{X i )5C£ j X j ). (4.9) 

i=l i=l i=l j=l 

For k = 2, we will denote the spectral density by /(A) = /2(A) = d2a(X)a(— A). 

We can formulate now a central limit theorems for quadratic functional of a linear field, which 
is a generalization of the results of Giraitis and Surgailis [44] and Giraitis and Taqqu [42] (see also 
references therein) for discrete time processes. This next theorem follows from the results of Sections 
3 and 4 (the proof is almost identical to the proof of Theorem 4 of Avram [14], the expression for the 
variance can be obtained by direct computations). 

Theorem 4.1. Let Xt = J u( - I a{t — u)^{ du), t £ I, be a linear random field with a square integrable 
kernel d(t),t G /, and a random measure £(du) admitting all moments and let 

Qt = Qt 1] = [ [XtX, - EX t X s ] b(t - s)u{ds)v{dt). 
Jt,seiT 

We assume that /(A) = (2ir) d |a(A)| 2 G L p and 6(A) G L ? , and in the continuous case we assume also 
that 6(A) G L g fl L\. 
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Assume that: 

1 1 1 

p q 2 

Then, the central limit theorem holds: 

lim T~ d / 2 Qx -» ^(O,^ 2 ), T —> oo, 

where 

a 2 := 2{2ir) d d 2 2 [ b 2 {\)f 2 {\)d\ + {2it) d d A ( [ b(\)f(X)d\ 



JS \JS 
where dk is the k'th cumulant o/£(2i) with I\ being the unit rectangle, that is 

c?2 = E^(/i) 2 ,d4 = E(£(/i) 4 ) — 2[E(£(/i) 2 )] 2 in the continuous case and dk is the k'th cumulant of 
f (0) in the discrete case, that is d 2 = E£(0) 2 , d 4 = E(£(0) 4 ) - 2[E(£(0) 2 )] 2 . 

4.2. Minimum contrast estimation based on the Whittle contrast function 

The class of Whittle estimators is the most popular in applications (see Whittle [74] , [75] , Giraitis 
and Surgailis [41], Fox and Taqqu [33], Heyde and Gay [47], [48], Heyde [49], Gao, Anh and Heyde [37], 
Leonenko and Sakhno [57], see also the references therein). 

In what follows we will consider continuous time linear processes (d = 1) whose spectral densities 
of all orders exist and admit the representation of the form (|4.9|) . 

We begin with the following assumption. 

A.I. Let Xt, t G It = [— \, -j] , be an observation of a real- valued measurable stationary linear pro- 
cess X t , t G R 1 , with zero mean and the family of spectral densities (|4.9p . Let a (A) = a (A; , dk = 
d k (0 {2) ) , that is, / 2 (A) = f(\,9), A G M 1 , 9 = (6^,9^) , 9 G G C R m , where 9 is a compact set, and 
the true value of the parameter 9q G int@, the interior of 0. Suppose further that / (A; 9\) ^ / (A; 62) 
for 9\ ^92, almost everywhere in R 1 with respect to the Lebesgue measure. 

Consider the Whittle contrast process (or objective function) 

u ^ = lU >og,{x ~ J) + 7T&)) w{x)dx - <410) 

where It (A) is the periodogram of the second order 

1 



It (A) 



2vrT 



X t e~ itx dt 



AGR 1 , (4.11) 



and w (A) is a symmetric about A = function such that all considered integrals are well defined and 
which will satisfy some conditions given below; in some cases we can choose w (A) = j^z- 
Introduce the Whittle contrast function 

To state the result on consistency of the minimum contrast estimator based on the contrast process 
(|4.10p we will need the following conditions on the spectral density / (A; 9) and the weight function 
w(X). 

A.II. / (A; O ) w (A) G Li (R 1 ) n L 2 (R 1 ) , V0 G 0. 
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A.III. There exists a function v (A) , A G R , such that 

(i) the function h (A; 9) = v (A) f^x-e) ^ s uniformly continuous in IR 1 x 0; 

(ii) f(\-e )^eL 1 (R 1 )^ (k 1 ). 

Theorem 4.2. Let i/ie assumptions A.I to A.III be satisfied. Then the function K (9q;9) defined by 
is the contrast function for the contrast process Ut (9) defined by \4.10Q . The minimum contrast 
estimator 9t defined as 

9 T = argminC/ r (6») (4.13) 
flee 

is a consistent estimator of the parameter 9, that is, 9t — ► #o * n P^-probability as T —* oo. 

The above theorem can be obtained as a consequence of a more general result by Leonenko and 
Sakhno [57] (Theorem 3), one needs just to rewrite for the case of linear processes the corresponding 
conditions on spectral densities, which become of much simpler form. 

Next set of assumptions (in addition to the above ones) is needed to state the result on asymptotic 
normality of the estimator f|4. 13|) . 

A. IV. The function f(x-fi) ^ s twice differentiable in a neighborhood of the point 9q and 

(i) / (A; 9 ) w (A) M&j^^ E L x (M 1 ) n L 2 (M 1 ) , i,j = 1, m, 9 € 9; 

(ii) / (A; 9 ) E L p (M 1 ) , w (A) Jt^j E L, (R 1 ) , 

for some p, q such that | + | < |, i = 1, m, fls9; 

(iii) TV2 ; r1 ( SJt (A) - / (A; 0„))«, (A) dA ^ as T ^ oo, 
for all i = l, m, £ 9; 

(iv) the second order derivatives d ® de /(A-e) ' ^ = ^' •••' m ' are continuous in 9. 

A.V. The matrices Wi (9) = (w$ (9)) , W 2 (9) = (w® (9)) ,V(9) = ( Vij (9)). j=1 m 

V / i,j=l,...,m \ / i,j=l,...,m " 

are positive definite, where 

• ! W = T~~ / w (A) ^- log / (A; 0) A log / (A; 0) dA, (4.14) 
4vr 7 R i 96»i 90,- 



»{? W = ^ I - 2 (A) I: ^ / (A; 9) A log ; (A; ,) dA . (4 . 15) 

^■(9) = r j w(X)—\ogf(X;9)dX w (A) — — log / (A; 6*) dA. (4.16) 

Theorem 4.3. Let i/ie assumptions A.I to A. V be satisfied. Then as T — > oo 

T i/2 ^ _ ^ » ^ (o, VFf 1 (fo) (W 2 (6» ) + V {9 Q ))W{ 1 (0o)) , 

where N m (-,-) denotes the m-dimensional Gaussian law. 

Reasonings for the proof of Theorems 14.21 14.31 are given in the next section. 

Comparing the above theorem with a more general result stated in [57], one can see that the set 
of conditions for the case of linear processes becomes of much simpler form, but the most important 
improvement is in condition A.IV(ii), which was achieved due to the application of the Theorem 14.11 
(see the proof). Note that corresponding condition for the case of general processes, formulated in [57], 
unfortunately, is difficult to check in general situation. 
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Remark 4.4. Condition A.IV(iii) will hold, e.g., if / (A; 9) is differentiable with respect to A and 

or under any conditions which assure 

|/ (A + h; O ) - f (A; 6 Q ) \w (A) A_J_ dA < C\h\ a , 

for a > | and C being a constant. 

Example. Estimation of fractional Riesz-Bessel motion (FRBM) (see Appendix A for details and 
definition of FRBM in non-Gaussian case). Let At, t £ I 1 , be a non-Gaussian Riesz-Bessel stationary 
motion, that is a stationary linear process with the spectral density of the form 

/(A)=/(A ' 6,)= i A |^ ( r+A^ > agm1 ' (4it) 

where the unknown vector parameter 6 = (7, a, c) 6 6, 6 being a compact subset of [|, 00) x (0, |) x 
(0, 00) . Note that the index a determines the long-range dependence of FRBM, and the parameter 7 
is another fractal index connected to Hausdorff dimension of paths of the stochastic process. Note that 
procedure of discretazation leads to the loss of information of one parameter 7, which is important 
for applications in both turbulence and finance theory. That is why a direct method of estimation of 
both parameters from continuous data looks appropriate. 

For this model we can choose the weight function w (A) = 1 _ ) 1 A2 , A € R 1 , to satisfy the conditions 
needed for consistency of the estimator (I4.13p . that is, for Theorem 14. 21 to hold. However, to satisfy all 
the conditions needed for Theorem 14.31 we choose the weight function w (A) = jj^pyi , A 6 K 1 , where 
a and b satisfy the restrictions:-^ > 1} A {a > b + 2} A {a > A + 2},where we have denoted by A the 
length of the finite interval carrying the admissible values of the parameter 7. With such a choice of 
the weight function we have the convergence 

T i/2 ^ _ 0o j " N3 ( 0j W -i (0 Q ) {W2 (0 Q ) + v (p ))Wr 1 (Oo)) as T -» 00, 
where the elements of the matrices W\ and W2 are of the following form: 



w. 



(1V2) 
11 

(1V2) 
22 



-L / u; lv2 (A)(ln(l + A 2 )) 2 dA; 

r / ^(AMlntA 2 )) 2 ^; 
4vr J R i 



4s V2) = 4^0 2 / - 1V2 (AW/A: 
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wT ] = 4! V2) = ^ / ri ^ (A) In (1 + A 2 ) In (A 2 ) dX; 
4a V2) = ™T ] =-W! - 1V2 (A)ln(l + A 2 )dA; 



(1V2) _ (1V2) 



= j^ 2 (X)ln(X 2 )dX; 



Vu 



V22 



[ w (A) In (l + A 2 ) dx] ; 

^|(/^ (A)ln(A ^ A ) 2; 
■o 2 ([ w (A) 1 



^33 = f w(X)dX 



v 12 = V2i = ^-%l w{X)\n(l + X 2 )dX I w (A) In (A 2 ) dX; 
8vr d\ J R i J R i 

1 d f f 

V13 = v 31 = - — 32 c o 1 / w(X)ln(l + X 2 ) dX w (A) dX; 

V23 = .32 = -^-^Cq 1 / it) (A) In (A 2 ) dX [ w(X)dX. 
8-Kdi J R i J R i 



(i) 



In the above formulae we mean that the weight function w (A) is involved to the expressions for w\- 

(2) 

in the 1st power and to the expressions for in the 2d power. From the above formulae we see 
that the covariance matrix of the limiting normal law has the charming feature that it appears not 
depending on the values «o an d 70 • 

Remark 4.5. Continuous version of Gauss- Whittle objective function with the weight function w(X) = 
had been used in [37] for the estimation of the Gaussian processes in stationary and nonstationary 
cases respectively. 



4.3. Minimum contrast estimation based on the Ibragimov contrast function 

We consider now the minimum contrast functional motivated by the paper of Ibragimov [52], see 
also Anh, Leonenko and Sakhno [7]. 

We assume condition A.I and introduce the following condition 
B. I. There exists a nonnegative function w (A) , A £ R, such that 

(i) w (A) is symmetric about A = : w (A) = w (—A) ; 

(ii) w (A) / (A; 6) is in L x (R) for V# £ 0. 
Under the condition B.I, we set 



a 2 (9)= [ f(X;9)w(X)dX 

JR 
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and consider the factorization of the spectral density 

f(X;9) = a 2 {9)i;(X;9), A G R, G 0. 
For the function ip (A, 9) , A G R, 9 G G, we have 



/ 



^ (A; 0) io (A) dX = 1 



and we additionally suppose 

B. II. The derivatives Voip (A; 9) exist and 



V e / i/> (A; 9) w (A) dA = / V e V (A; 0) w (A) dA = 

that is we can differentiate under the integral sign in the above integral. 
Consider the following contrast process (or objective function): 



U T (9) = - It (A) w (A) log V> (A; 9)dX, 9 G G. (4.18) 
Jr 

Define also the function 

K (fl ; 0) = [ f (A; 9 ) w (A) log % (A; * o) dA, #o> # G G. (4.19) 
7k ^ (A; 0) 

B. III. / (A; O ) u; (A) log ip (A; 0) G Lj (R 1 ) n L 2 (R 1 ) , V0 G Q. 
B. IV. There exists a function u (A) , A G R , such that 

(i) the function h (A; 9) = v (A) log ip (A; 0) is uniformly continuous in R 1 x G; 

(ii) / (A; Li (R x )nL 2 (R 1 ) . 

Theorem 4.6. Let conditions AI, B.I - B.IV be satisfied. Then the function K(9q;9) defined by 
{4-19 ) is the contrast function for the contrast process Ut (9) defined by fl^. 18\ ). Moreover the minimum 



contrast estimator 9t defined as 

9 T = argminC/r (9) , (4.20) 
eee 

is a consistent estimator of the parameter 9, that is, 9t — > 9q in P^-probability as T —* oo, and the 
estimator 

a T = I I T (X)w(X)dX 



is a consistent estimator of the parameter a 2 (9) , that is, a™ — > a 2 (9q) in Pq -probability as T — > oo. 

To formulate the result on the asymptotic distribution of the minimum contrast estimator (I4,20p 
we need some further conditions. 

B. V. The function ip (A; 9) is twice differentiable in a neighborhood of the point 9q and 

(i) / (A; 9) w (A) ggg- log $ (A, 9) G Li (R) n L 2 (R) ,i,j = 1, m, G G; 

(ii) / (A; O ) G L p (R 1 ) , w (A) ^ log V (A, 0) G L q (R 1 ) , 
for some p, q such that - + ~ < 5, i = 1, m, 9 G Q; 
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(iii) T 1 / 2 J Kl (EI T (X)-f(X;9 ))w(X) J-log^(A;0) dX -> as T ^ oo, 
for all i = 1, m, 6 0; 

(iv) the second order derivatives qq.qq . log V> (A; 0) ,£ = 1, m, are continuous in 0. 

B. VI. The matrices S (9) = (s{j (9)) i J=1 m and A (9) = (a^- (9)) i - =l m are positive definite where 



(9) = Jj(X;9)w(X)^ f logiP(X;9)dX 



d 2 



d9id9j 



dX, 



= a 2 (0) [ w(X) 

an (9) = 4vr / f 2 (A; 9) w 2 (A) log ifi (A; 0) log ^> (A; 0) dA 

d 4 /■ w(A)/(A;fl) y w(X)f(X;9) d 

+27T l2 / T7V~a\ — oFV A e ^ / , . — 7^- i>{X;9)dX 

d\ J R ip (A; (9) 00< 7 R ^ (A; (9) 89 j 

= 4vr (a 2 (0)) 2 jf ™ 2 (A) ^ (A; 0) ( A; dA 

+2vr§ (a 2 (0)) 2 / «,(A)^(A;(9)dA / w (X) ^ (X; 9) dX 



Theorem 4.7. Let i/ie conditions AI, B.I - B.VI be satisfied. Then as T 

v 



DC 



T i/2 # T _ 9o \v Njn (o, 5- 1 (0 O ) ^ (0 O ) 5- 1 (0 O )) 



where N m (•, •) denotes the m- dimensional Gaussian law. 
Proofs of Theorems EOJ, EDS, 331 13TTI 

The results on consistency of estimators (Theorems 14.21 and 14.61 ) are consequences of corresponding 
theorems stated for the general case in [57] for the Whittle functional and in [7], [8] for the case of 
Ibragimov functional. We present here reasonings for the proofs of Theorems 14.31 and 14.71 which make 
use of CLT for bilinear forms (Theorem 14. 1 1 above) . For the proofs the standard arguments based on 

Taylor's formula for \7qUt ( 0t) are used. Namely, we can write the relation 



V e U T [Gt) = V e U T (9 ) + V e V' e U T (9* T ) (9 T - O ) , 



where \0 



T 



< 9 T 



It follows from the definition of minimum contrast estimators that for sufficiently large T 



-V V g U T (050 [Gt - 9 



V e U T 



therefore, to state the asymptotic normality for the estimator 0? , by Slutsky's arguments, one needs 
to deduce: (1) limit in probability for V q*7'qUt (0|O and (2) limiting normal law for T l l 2 V ' qUt (0q) ■ 

For the 1-st task we can use the same arguments as in the mentioned above papers, and to rewrite 
(simplify) corresponding conditions for the case of linear processes. 
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However, for the step (2) we can appeal now to Theorem 14. 11 We provide the details below. 
Consider firstly the case of Whittle functional. Limit in i-fa-probability for VqVqUt (0j>) is given by 
the matrix W\ (0$) . 
Next, consider 



+ V 6 



1 



which can be written in the form 

V e U T (0 o ) = (Mvi)-J(Vi))i=i,..., m 

fi (A; 0q) It (A) dX - [ ^ (A; ) f (A; O ) dX 



I T (A) w (A) dX, 

do J 



i=l,...,m 

where 

<Pi = ifi (A; <9 ) = - ]- 2 1 w (A) l-^-f (A; 9) J , i = l,...,m. 
4TTf 2 {X;0 o ) \d0i J e=6o 

Under the assumptions of Theorem 14.31 (see A.IV(ii)) in view of Theorem 14. II we have the convergence 

T 1 ' 2 (J T (ipi) - EJ T (ipi)) l=1 _ m 3 N m (0, W 2 (0 O ) + V (0 O )) , (4.21) 

where the the matrices W 2 (0o) and V (9q) are defined in the assumption A.V. 
Further, in view of the assumption A.IV(iii) 

T 1 / 2 (EJ T {w) - J(<pi)) 0, as T -» 00 
which, combined with (|4.2ip . implies 

T 1 / 2 (J T (^) _ J(^))i=i,..., m = T^VgUr (0 O ) ° N m (0, W 2 (0 O ) + V (0 O )) . 

The case of Ibragimov functional is treated analogously. We have that V bVqUt (0 T ) converges in 
Po-P r obability to the matrix S (0o) ■ Further, 

V e U T (0 o ) = - I I T (X)V e log ^(X;e)\ e=6o w(X)dX. 

JR 

In view of B.II 

/ / (A; O ) V e log V (A; 9)\ e=0o w (A) dX = 0, 

JR 

and we can write 

v e u T (0o) = {Mn)-J^i))i=x,..., m 

ipi (A; 0q) It (A) dX - I w (X; Q ) f {X; O ) dX 
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where now 

d 

tpi = cpi (A; O ) = w (A) — log if) (A; 0) 



, i = 1, m. 

'o 

Again in view of Theorem I4.lt under the assumption B.V(ii), we obtain the convergence 



T 1 ' 2 (J(fpi) - EJ T (<pi)) Z N m (0, A (0 O )) , as T -> oo, 

where the matrix A (0o) is defined in B.VI. By assumption B.V(iii) the convergence T 1 ^ 2 'VqUt (0o) —* 
N m (0,A(9 Q )) follows. 

Appendix A. Fractional Riesz-Bessel Motion 

In this Appendix we mainly review a number results discussed in Gay and Heyde [36], Anh, Angulo 
and Ruiz-Medina [4], Anh, Leonenko and Mc Vinish [5], Anh and Leonenko [2], [3], Kelbert, Leonenko 
and Ruiz-Medina [54] (see also references therein). Also we introduce a not necessarily Gaussian 
Riesz-Bessel stationary process and formulate the central limit theorem for such a processes as well 
as for quadratic forms of such a processes. 

The fractional operators are natural mathematical objects to describe the singular phenomena of 
random fields such as long range dependence or /and intermittency. 

In particular Gay and Heyde [36] introduced a class of random fields as solutions of fractional 
Helmholtz equation driven by white noise, contained the fractional operator (cl — A)"/ 2 , c > (and 
its limit as c —* (—A)"' 2 ), where A is the (i-dimensional Laplacian and I is the identity operator (see 
also [54] for properties of such fields and possible generalization). Anh, Angulo, Ruiz-Medina [4] (see 
also [5], [2], [3] and references therein) generalized the fractional stochastic equation of Gay and Heyde 
in which the fractional Helmgoltz operator (cl— A) a / 2 , c > or the (i-dimensional Laplacian (c — ► 0) is 
replaced by a fractional Laplace-type operator of the form — (I — A) 77 " 2 (—A)"/ 2 , a > 0, 7 > 0, where 
the operators — (/ — A) 7 / 2 , 7 > 0, and (—A)"/ 2 , a > 0, are interpreted as inverses to the Bessel 
and Riesz potentials (see [68], pp. 134-138), that is integral operators, whose kernels have a Fourier 

transforms (2vr)- d / 2 (l + ||A|| 2 ) , AG R d , and (2vr)~ d / 2 ||A|p Q , A G R d , respectively. Then there 

exists a generalized random field C( x )> x G on fractional Sobolev space, which is defined by the 
equation 

(I - A) 7/2 (-A) Q / 2 C(x) = e(x), x G R d , (A.l) 

where {e(x), x G is a Gaussian white noise or equivalently (in the sense of second-order moments) 
there exists a random field with the spectral density 

/(A) = —t 1 A G R d , c > 0. (A.2) 

||A|| 2a (1 + IIAII 2 



These random fields were named the fractional Riesz-Bessel motion. 

For the random fields with stationary increments we assume a G (£, | + l) , 7 > 0. In particular, for 
d = 1, there exists a Gaussian stochastic process with stationary increments and the spectral density 
(|A.2p . where a G Q, | ) , 7 > 0. This fractional Riesz-Bessel motion (FRBM) is a generalization of 
the fractional Brownian motion (FBM) (see, for instance, Samorodnitsky and Taqqu [66]). FBM is a 
limiting case of the Riesz-Bessel (non-stationary) motion with 7 = (in terms of the Hurst parameter 



32 TITLE WILL BE SET BY THE PUBLISHER 

H E (0,1), the spectral density of the FBM with long-range dependence (H E (^,0)) is equal to 
— -ttw+t)- The FRBM is not self-similar (unless when 7 = 0), but it is locally self-similar. 

For d > 1 the presence of the Bessel operator is essential for a study of homogeneous (and isotropic) 
solutions of (jA.ip with spectral density (|A.2[) . which requires < a < £, a + 7 > |; that is the 
condition 7 > is necessary for /(A) E Li(R d ). Thus the homogeneous isotropic FRBM can be 
defined as a Gaussian random field with zero mean and covariance function of the form 

B a Jx) = [ e i<x ' x> ^d\, x E R d , (A.3) 

J iixii 2q fi + ||A|i 2 1 

where < a < i, a + 7 > |. Note that for a = 0, the covariance structure (|A.3|) belongs the Matern 
class, that is with 

m . 

S <vr(*) = 2- d 7T- ,x x E M d , 7 > ~, (A.4) 

2— r(^)i|x||— 2 



2 

where 



3C 



K u {z) = ^J s^expl—fs + ^jzlds, z > 0, 1/ E R, 


is the modified Bessel function of the third kind of order v or Mc Donald's function. Note that 

K v {z) = K_ v {z), K v {z) ~ T{u)2 v - l z- u , 

for > as z — > 0, 

km = 

^ ' V 2 

Thus, we have B aj7 (0) = 1. 

Note that for d = 1, a = 0, 7=1, the covariance structure (|A.4|) becomes f?o,i(aO = e~ x , x > 
0, that is stationary Gaussian Riesz-Bessel motion is identical to the Gaussian Ornstein-Uhlenbeck 
process. 

Remark A.l. These results can be generalized to the case when the above fractional operator is 
replaced by the operator 

a/3 

= _ + /,(/- A^-A)^ 2 , 0<(3<2, a>0, 7 > 0, 

where jjjp is the regularized fractional derivative. In particular, the Green function of the fractional 
heat equation: Hu(t, x) = 0, t > 0, x E M d , can be given as inverse Fourier transform of the function 

E/3 (-filfi \\Xf (1 + ||A|| 2 ) 7 / 2 ) , t > 0, A E m d , 
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where 



OO i 



is the Mittag-Leffler function (see [2], [3] for details and references). 

In order to introduce a Riesz-Bessel motion driven by Levy noise, we restrict our attention to the 
stationary case and d = 1 (replacing the space parameter x into t). For the function 

fc 

a (A) = tttt X rrr - , A G M, a + 7 > 1, a > 0, 

we introduce the function 

a(t) _ / e«VA )(i A - { J^"^" + « 2 0, (JU) 



where the confluent hypergeometric function 

(<£» 

(6)n n! 



^(a, 6; *) = ]T ^, (c) n = c(c + 1) • • • (c + n - 1), (c) = 1. 



n=0 

The Riesz-Bessel motion driven by Levy noise can be defined as the linear process 

Xt = [ d(t - s)dC(s), (A.6) 

where £(t), t G i? is a Levy process with cumulant function 

k(z) = log-Eexp {zz£;(l)} , 

such that «w(0) / 0, k > 2, and a(.) is defined by (|A.5|> . The fc-th order spectral densities of the 
Riesz-Bessel motion driven by Levy noise (|A.6|) take the form: 

/ fc (Ai, . . . , A fc _i) = (2vr)- fc+1 i- fc K ( fc )(0)a(A 1 ) • • • a(X k ^ 1 )a(X 1 + • • • + A fc _i), (A.7) 
which reduces to the second-order spectral density 

/ 2 (A) = —= , (A.8) 

; |A| 2a (l + A 2 )^ V ; 



AC( 2 )(0) 



< a < -, a + 7 > -, A G R 1 . 



2tt 2 '2 



For the Gaussian case, of course, K^ k \0) = 0, k > 3. 

Note that for a = 0, 7 = 1 we arrive to the Ornstein-Uhlenbeck process driven by Levy noise ( [6]). 
As a consequence of the Theorems of Sections 4 and 5 we obtain the following result for the linear 
process (|A.6|) . (Cf. also with Theorem 4.1) 
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Theorem A. 2. Consider the Riesz-Bessel stationary motion iA.6\) and assume that all cumulants of 
Levy process are finite. Let 



i-T/2 r-T/2 pT/2 

S T = / X s ds, Qt= / b(t - s) [X t X s - EX t X s ] dtds, 

J-T/2 J-T/2 J-T/2 



then: 

i) if 

a + 7 > -, a < 0, 

then the central limit theorem holds: 

T- 1/2 S T -> N(0, a 2 ),T -» do, 

where 

a 2 = K^(0); 

ii) if for some p > 1, q > 1, such that - + | < we have 

6(A) G L 9 ,a + 7 > a < 

2p 2p 

t/ten £/te central limit theorem holds: 

T- l ' 2 Q T 7V(0,<7 2 ),T oo, 

w/iere 

<t 2 = 2k( 2 )(0) / 6 2 (A)^ dAW 4) (0)-^ / 6(A)— 5 -dX. 

K ' h |A| 4a (l + A 2 ) 2 7 1 ; (2vr) 2 7 M V ; |A| (1 + A 2 ) 7 

Appendix B. Kernel estimates 
Consider the Dirichlet type kernel 

A T (A)= / e itx u{dt). 

When d = 1 and ii = [—1/2,1/2] (ly = JiT), one gets the clasical discrete/continuous time 
Dirichlet kernels: 

A m sin((r + l)A/2) f T / 2 itA sin(rA/2) 

At A) = > e = — —— , A r A = / e dt = — — , 

^rl sm(A/2) J-t/2 A/ 2 



-T/2 



respectively. For general d and It = [— T/2, T/2] d , putting A = (Ai,..,A<f), it follows that Ay(A) 
II" ,A/iA ; i. 
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Note that in the continuous case, by scaling, one finds 



|A T (A)| P = T 1 ~ 1 / p C p , AgR, l<p<oo, 
A T {\)\ p = T d l 1 - 1 Mc%, A G M. d , l<p<oo. 



(B.l) 



(B.2) 




In the discrete case, similar estimates may be obtained by using the inequality 




We find then: 



A T (A)|| P <T< 1 - 1 ^ &v 



Ar(A)|| p < T f 




, A G [ — 7T, 7r), 1 < p < OO, 



A = (Ai, .., Ad) G [-it, n) d , Kp<oo. 



\R / 

In the case of the Euclidean ball It = Bt = {t G R d : \\t\\ < T/2}, we find again by scaling in the 
continuous case 



where J v (z) is the Bessel function of the first kind and order v. It is known that J v (z) < const j^fz 
for a large z, thus for the ball 



where |s(l)| is the surface area of the unit ball in M. d , d > 2. 

Similar estimates may be obtained for the L p (1 < p < oo) norms of the discrete Dirichlet kernel. 

Note: These results are particular cases of the so-called Hardy-Littlewood Theorem (see, for in- 
stance, Zigmund [76], V. II, XII, §6), which can be formulated as follows: 

Theorem B.l. Let a n > a n+ i > ■■■ and a n — > 0. Consider the series 







(B.3) 



n=l 



and 



OG 




(B.4) 



n=l 
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and define by /(A) and g(X) the sums of the series HB.3\) and (B.4) respectively at the points where 
the series converge. A necessary and sufficient condition that the function f (or g) belongs to L p , 
1 < p < 00, is the following 



p-2 



< 00. 



n=l 



Moreover, 



Clearly, 1 = 1 = - 
for Dirichlet kernels: 



n=l 



1 > 



is nonincreasing, and we arrive thus to the following estimate 



t=i 



>i=l 



and 



where the error term 



T-1 

E' 

t=o 



d\ = TP- 1 - 



7T 



/•oo 


sinn 


Jo 


u 



du + R p {T), 



O p {TP-% p>3 
R P (T) = { O(logT), p = 3 

O p (l), Kp<3, 

where O p means that constants depend on p.(See, e.g., [1].) 
Note that for p = 1 



dAx log T. 

Appendix C. The homogeneous Holder- Young-Brascamp-Lieb inequality 

Subtle modifications of the conditions of the Holder inequality must be made when the arguments 
of the functions involved are restricted to some subspaces [35]. Starting with Brascamp and Lieb [26] 
and Lieb [56] (who considered only the case S = R), and following with Ball [17], Barthe [18] and 
Carlen, Loss and Lieb [29], this generalization of the classical inequalities of Holder and Young seems 
to have attained now its definite form in the work of Bennett, Carbery, Christ and Tao [20], [19]. 

We review now a particular case of this result. 

Let 

x = (x\, . . . , x m ) £ S 
where S may be either the multidimensional torus, integers or reals 




'[-71,71] 

S = < Z d 
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endowed with the respective normalized Haar measure fi(dx). 
When d = 1, the convergence of integrals of the form: 



fj,(dxi)...fi(dx m ) 
, e5 m h(x) z K..l k (x) z >> 



where ...,1k) are linear transformations 



L : S r - 



> S, lj(x) =< a>j, x >, j = l,...,k 



and where in the first two cases a,j are supposed to have integer coefficients, is a fundamental question 
arising in many applications. 

Let M denote the matrix with columns ctj,j = 1, ...,k. It was for long known to physicists that, 
when M is fixed, convergence holds for z = (zi, z/,) belonging to a certain "power counting polytope" 
PCP (these are relatively similar in all the three cases - see Theorem [Cj] below). 

It was first noticed in [12] and [13], in the easier case of unimodal matrices M, that under the 
same "power counting conditions" on Zj = pj , a Holder-type inequality 



(GH) 



iifMWMdx) < J? nil/,- 



I Pi 



holds, with the powers in (??) being replaced by arbitrary functions satisfying integrability conditions 
fj G L Pj ,j = l,...,k, and with K = 1. Note that Brascamp and Lieb [26] had already studied the 
analog harder inequality for general matrices (in the case S = but without pinpointing exactly 
the polytope; this was done later by Barthe [18]. 
For an example, consider the integral 



J= I I fl(xi)f2(x 2 )f3(x 1 +X2)h(x 1 -X 2 )dx 1 dX2 

>s Js 



where S 



Here m = 2, k = 4 and the matrix 

M : 



10 11 
11-1 

has rank r(M) = 2. The theorem below will ensure that 

\A < ll/l|ll/«i ll/alll/sa ll/3||l/z 3 ll/4||l/«4 
for any z = (z\, z 2 , Z3, 24) G [0, l] 4 satisfying z\ + z 2 + Z3 + Z4 = 2, e.g. if z = (0, 1, 1/2, 1/2), then 

|J| < (^pJ/iC^l) (f* \M x )\ dx ) (£ f * ix)dx ) ' (£ f ^ x)dx ) ' • 

It is easy to check (and true in general) that the extremal points of the PCP have only and 1 
coordinates (which may be exploited for establishing the result). Note also that the matrix M in 
this example is not unimodal; as a consequence, the optimal constant K = K(z) is not 1 at all the 
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extremal points, the exception being (0,0, 1, 1), where it is 2 _1 ; also, the functions achieving equality 
must be Gaussian (which holds in general, cf. Brascamp-Lieb [26]). 

We will formulate now simultaneously the Holder- Young-Brascamp-Lieb inequality in the three 
cases: 

(CI) n{dxj) is normalized Lebesgue measure on the torus [— tt, ir] d , and M has all its coefficients 
integers. 

(C2) n(dxj) is counting measure on Z d , M has all its coefficients integers, and is unimodular, i.e. 

all its non-singular minors of dimension m x m have determinant ±1. 
(C3) /j,(dxj) is Lebesgue measure on (— oo, +cc) d . 

The result below specifies the domain of validity of Holder's inequality, called power counting 
polytope, in terms of linear inequalities involving the rank r(A) of arbitrary subsets A of columns 
of the matrix M (including the empty set 0). It is also possible to express the inequalities in terms 
of the "dual rank" r*(A) defined by a dual matrix M* whose lines are orthogonal to those of M, by 
using the duality relation 

r*(A) = \A\ -r(M) + r(A c ), WL 

Theorem C.l. (Homogeneous Holder- Young-Brascamp-Lieb inequality) . Letlj(x) = x t aj,j = 
l,...,k be linear Junctionals lj : S m — > S where the space S is either the torus [— tt, Tr] d , 7L d , or~R d . 
Let M denote the matrix with columns ctj,j = 1, k, and let r(A),r*(A), denote the rank and dual 
rank of any set A of columns of M. 

Let fj, j = l,...,k be functions fj £ L pj (a(dx)), 1 < pj < oo, defined on S, where a(dx) is 
respectively normalized Lebesgue measure, counting measure and Lebesgue measure. 

Let Zj = -p, j = 1, . . . , k, and z = (z\, z^). The Holder-Young-Brascamp-Lieb inequality (GH) 
will hold (with K = K(z) < oo) throughout the "power counting polytopes" POP defined respectively 



by: 






(cl) 




< r(A), WL 




jeA 




(c2) 




> r(M) -r{A c ), ^(1 - Zj ) < r*{A) MA 




jaA 
k 


jeA 


(cS) 




= m, and one of the conditions (cl) or (c2) is satisfied. 




3=1 





Alternatively, the conditions (cl-c3) in the theorem are respectively equivalent to: 

(1) z = (z±, . . . , Zk) lies in the convex hull of the indicators of the sets of independent columns of 
M , including the void set. 

(2) z = (z±, . . . , z^ lies in the convex hull of the indicators of the sets of columns of M which 
span its range. 

(3) z = (z±, . . . , z^ lies in the convex hull of the indicators of the sets of columns of M which 
form a basis. 

If, moreover, the matrix M is unimodal, then the Holder- Young-Brascamp-Lieb inequality ( GH) 
holds with constant K = 1. 

Notes: 1) Polytopes defined by the type of rank constraints appearing in cases (cl-c2) are called 
"polymatroids" (associated to M and M*) - see Welsh [72], 18.3, Theorem 1. The third polytope is 
the intersection of the first two. 
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2) The first two cases of Theorem I C . 1 1 were obtained for unimodal matrices in [12] and [13], respec- 
tively. 

3) Some further important issues, like the precise formula for K(z), and the nonhomogeneous 
extension where L may be linear operators with possibly different images, were resolved only recently 
- see Lieb [56], Bennett, Carbery, Christ, Tao [20]. 

Proof sketch: By Edmonds theorem (see Welsh [72], 18.4, Theorem 1) the extremal points of the 
above polymatroids have only and 1 coordinates for any matrix M. This fact leads to an easy proof, 
since at the extremal points the result is immediate. For example, in the first and third cases, the 
extremal points are in one to one correspondence with the indicators of independent sets and bases 
A, respectively, and the constant at such a point, by a change of variables, is seen to be: 

K A = y/detiAA*) 

Since K(z) is finite at the extremal points, and Riesz-Thorin interpolation ensures the convexity of 
\og(K(z)), it follows that K{z) will remain finite throughout the polytope generated by the indicators 



Appendix D. Wick products and Appell polynomials 
Let W be a finite set and Yi, i E W be a system of random variables. Let 

Y w =]jY i 

denote the ordinary product, with Y® = 1, let m w = -EHigiy Yi ^ e the (mixed) moment, and let 

x (Y w ) = x (Y,ie W) 

denote the (mixed) cumulant of the variables Yi,i G W, defined recursively as the solutions of the 
equations: 



w 

m 



E x(Y V ^)---x(Y Vr ), (D.l) 
{V}\-W 



where the sum Yl{v}\-W * s over a ^ partitions {V} = (Vi, . . . ,V r ), r > 1 of the set W, and where 
X (Y 9 ) = 1. ' f 

Notes: 1) The equation (jD.ip is the formal power series expression of the "exponential relation 
m = e x " between moments and cumulants, viewed as functions on the lattice of subsets [59]. 

2) The inverse of the equation (ID.ljl . the formal power series expression of the "logarithmic relation 
X = log(m)" may also be computed by: 



X(Y1, • • • , Y n ) = &T log £exp(E ZjYj] 



dz\... dz n 

i=i 



zi="-=z n =0 



where the differentiation is interpreted formally if the moment generating function does not exist. 
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Definition D.l. The Wick products : Y w : are defined as the solutions of the recursion: 

Y w = £ : Y u : E(Y W \ U ) =^:Y U : £ X (Y^) • • • X {Y V % 
ucw ucw {v}\-w\u 

where the sum J2ucW ^ s taken over all subsets U C W, including [7 = 0, the sum J2{v}\-W\u 1S over 
all partitions {V} = (Vi, . . . , V r ), r > 1 of the set W\U, and the starting value is : Y® := 1. 

Notes: 1) Inverting the recursion yields ( [69], Proposition 1): 

■.y w --=E yU E (-i) r x(r Vl )---x(r v -), 

few {v}\-w\u 

as may be formally seen by replacing to -1 by e~ x . 

2) When some variables appear repeatedly, it is convenient to use the notation 

: Yti > ■ ■ • j ^ti j •••) ^t fc ! ■ ■ ■ j Yt k := -P?n,...,n fe (^ti > •••) ^tfc ) 

V v ' V v ' 

ni n k 

(the indices in P correspond to the number of times that the variables in ": :" are repeated). The 
resulting multivariate polynomials P m ,...,n k are known as Appell polynomials. These polynomials are 
a generalization of the Hermite polynomials, which are obtained if are Gaussian, and like them they 
play an important role in the limit theory of quadratic forms of dependent variables (cf. [69], [44], [11]). 
4) The Appell polynomials may also be directly defined by "power- type" recursions like: 

d 

7j Pni,...,n k ■ ■ ■ j x k) = n jPni,...,rij — l,...,n k • • • > ■^Pn\,...,n k (-^1 1 • • • i -^k) = 

y nj > 0, j = 1, ...,k, ^rij > 1, 
3 

Pq,...,o{xi, . . . ,x fc ) = 1. 

For example, when m = n = 1, Pi j i(X(, X s ) = XfX s — EXt X s , and the bilinear form Qt(Pi,i) is a 
weighted periodogram with its expectation removed. 

Note that the multivariate Appell polynomials can be defined by using characteristic functions as 
well (see, e.g., [70]). 



Appendix E. The diagram formula and the moments/cumulants of 

sums/bilinear forms of Wick products 

E.l. The cumulants diagram representation 

An important property of the Wick products is the existence of simple combinatorial rules for 
calculation of the (mixed) cumulants, analogous to the familiar diagrammatic formalism for the mixed 
cumulants of the Hermite polynomials with respect to a Gaussian measure [58] . Let us assume that 
W is a union of (disjoint) subsets W\, . . . , Wf~- If (i, 1), (i, 2), . . . , (i, rii) represent the elements of the 
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subset Wj, i = 1, . . . , k, then we can represent W as a table consisting of rows W\, . . . , Wk, as follows: 

(1,1),..., (i,m) \ 

= W. (E.l) 

(k, 1), . . . , (k,n k ) J 

By a diagram 7 we mean a partition 7 = (Vi, . . . , V r ), r = 1, 2, . . . of the table PF into nonempty sets 
Vi (the "edges" of the diagram) such that \Vi\ > 1. We shall call the edge V* of the diagram 7 /Zai, 
if it is contained in one row of the table W; and free, if it consists of one element, i.e. \Vi\ = 1. We 
shall call the diagram connected, if it does not split the rows of the table W into two or more disjoint 
subsets. We shall call the diagram 7 = (V\, . . . , V r ) Gaussian, if \V\\ = ■ ■ ■ = \V r \ = 2. Suppose given 
a system of random variables Yij indexed by (i,j) € W. Set for V C W, 

y y n y *j> and : yy : = : g y ) : • 

(<J)ev 

For each diagram 7 = (Vi, . . . , V r ) we define the number 

r 

/ 7 = 11x^0- (E-2) 

i=i 

Proposition E.l. (cf. [44], [69]) Each of the numbers 

(i) EY W = E(Y Wl . . . Y Wk ), 

(ii) E(:Y Wl :Y Wk :), 
(in) X (Y W \...,Y W »), 
(iv) X (--Y W > Y w * :) 

is egtia/ to 

where the sum is taken, respectively, over 

(i) all diagrams, 

(ii) all diagrams without flat edges, 
(Hi) all connected diagrams, 

(iv) all connected diagrams without fiat edges. 
If EYij = for all S W, then the diagrams in (i)-(iv) have no singletons. 

Notes: 1) Part (i) is just the exponential relation between moments and cumulants. 
2) From part (ii) follows, for example, that E : Y w := (take W = W±, then W has only 1 row 
and all diagrams have flat edges). 

E.2. Multilinearity 

An important property of Wick products and of cumulants is their multilinearity. For sums and 
bilinear forms 

St = Sf= f P m (X t )u(dt), Q T = Q™' n = f f b(t - s)P m , n {X t ,X s )u(dt)v{ds) 
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this implies that: 

(1) 



Xk(Sr, —, St) = 

I XO X ti,n ■ ■ ■ , X tl:m X tK1 ,...,X tkm :) JJz/(dii), 

JUgIt .•_ 1 



where the cumulant in the integral is taken for a table W of k rows R\....,Rk, each containing 
the Wick product of I variables identically equal to X th . 

(2) 



Xk(Qr, -;Qt) = 

/ x( : -Xfi,i> • • ■ ,X thm , X Sl l ,. . . ,X Sl n :, 

J U,Si£lT 



U,Si£lT 

k 



1=1 



where the cumulant in the integral needs to be taken for a table of k rows R\ , Rfc , 

each containing the Wick product of m variables identically equal to X tk and of n variables 
identically equal to X Sk . 



E.3. The cumulants of sums and quadratic forms of moving average tables 

By part (iv) of Proposition IE.H applied to a table W of k rows Rx....,Rj., with K = ri\ + ■■■Uk 
variables, and by the definition (jE.2[) and of J7, we find the following formula for the cumulants of 
the Wick products of linear variables (|4.7|) : 



X(: X tlA , ... ,X tlni X tkA , ... ,X tknk :) — ^ k 7 J 7 (F), (E.3) 

jer(m,...,n k ) 
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where T(ni, . . . , rife) denotes the set of all connected diagrams 7 = (V\, . . . , V r ) without flat edges of 
the table W, k 7 = x\Vi\(£h) ■ ■ ■ X\v r \(Ch) and 

r 

j 7 (ti, ...,**-) = n j ^(%) ( e - 4 ) 
3=1 

= / ] [ \a(tj,i - Sj,i)a(tj, ni - Sj ini ) . . . 

J si,...,s r e / - =1 

• • • - Sfe,i) . . . a(tfc,n fc - Sfc,n fe )] ^(dsi), • • • , ^(ds r ) 

K r K 



Al '---' A ^ i=l j=l ieVj i=l 



L 



K 



Al ' - ' A ^ j=i \ ieVj I i=i 



where s k ,i = Sj if (fc, i) eV j: j = 1, . . . ,r and A^ = ^ if (j, i) G Vj, j = 1, . . . , r. 

E.4. The cumulants of sums and quadratic forms of moving average processes. 

We will apply now the formula (|E,4p to compute the cumulants of , Q^' n ^ . In this case, each 

row j contains just one, respectively two random variables. 

(2) 

It is easy to check that the variance of Sj. is: 

r 2 

X2 (4 2) ) = 2 / /(A 1 )/(A 2 )A T (A 1 -A 2 )A T (A 2 -A 1 )TT/x(dA e ). 



e=l 



Note that there are only two possible diagrams on a table with two rows of size 2, and that they yield 

both a graph on two vertices (corresponding to the rows), connected one to the other via two edges. 

(2) 

For another example, the third cumulant x^i^T ) * s a sum °^ terms similar to: 

3 

2 2 / /(Ai)/(A 2 )/(A 3 )A T (A 1 - A 2 )A T (A 2 - A 3 )A T (A 3 - Ax) ]J fx(d\ e ). 

This term comes from the 2 2 diagrams in which the row 1 is connected to row 2, 2 to 3 and 3 to 1. 

For quadratic forms, a further application of part (iv) of Proposition IE. II will decompose this as a 
sum of the form 

^ / FLj(ti,Si) Y[b ti - Si dtidsi, 
7 er(n lv ..,n fc ) l ' Sl£/T i=1 
where r(m, . . . ,rik) denotes the set of all connected diagrams 7 = (V\, . . . , V r ) without flat edges of 
the table W and i? 7 (ij,Sj) denotes the product of the cumulants corresponding to the partition sets 
of 7. This easy to check formula is also an illustration of the diagram formula. 
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When m = n = 1 and k = 2, besides the Gaussian diagrams we have also one diagram including all 
the four terms, which makes intervene the fourth order cumulant of Xf. 

When m = n = 1, the Gaussian diagrams are all products of correlations and the symmetry of 
b implies that all these 2 k ~ 1 (k — 1)1 terms are equal. We get thus the well-known formula for the 
cumulants of discrete Gaussian bilinear forms. 

In general, we find decompositions as sums of certain "Fejer graph integrals" , associated to specific 
graph structures. 

The general structure of the intervening graphs for the cumulants of sums St and quadratic forms 
Qt have been discussed in Section 13.11 (see Example 13. 5ft . 
The following proposition is easy to check. 

Proposition E.2. Let Xf,t £ It denote a stationary linear process given by 7[ ) with d= 1. Then, 
the cumulants of the sums and quadratic forms defined in {E.J$ are given respectively by: 



xw = Xfc(4 m) .-. o(m) 



St )= E K 7°7( T ) 
7er(m,fc) 



and 

Xk,m,n = XkiQx \---iQt 

where At(x) is the Fejer kernel, T(l,k), T(m,n,k) were defined above, and 



7Gr(m,n,fc) 



aJT) 



[ n e a «) n a ^ n ^e a *) n dx 

JX 1 ,...,X K j= i i=m(j-i)+i i=l j=l ieVj i=l 



(E.5) 



tJT) 



/J l l,...,(Mj t ,\l ) ...,\ K ,\ 1 ,...,\' Kl 

k 



n 



mj tij 

3=1 [_ ' i=m{j~l)+l i=n(j-l)+l 

K K' r K K' k 

n«(Ai) ii^e^+e^) n^n^n^ 

8=1 i=l 3=1 i&Vj i=l i=l i=l 



(E.6) 



These graph structures are simple enough to allow a quick evaluation of the orders of magnitude 
ajn(z), via the corresponding graph-breaking problems; for the case of bilinear forms we refer to 
Lemma 1 in [14]. 
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For the case of sums, the domain of applicability of the CLT is 1 — z\ > 1/m. We check now that 
at the extremal point 1 — z% = 1 /m we have 

= max[co(G — A) 

= max[co(G — A) 
< k/2, VG G g k , 

where we interpret p(A) as a "profit," equal to the "gain" co(G — A) minus the "cost" YleeA^- ~ Ze )- 
We thus need to show that at the extremal point 1 — z\ = 1/m, 

co(G - A)} < \A\/m + k/2, VG G Q k . 

Indeed, this inequality holds with equality for the "total breaking" A = £ (which contains (km)/2 
edges). It is also clear that no other set of edges A can achieve a bigger "profit" p(A) (defined in 
(|E,7p ) than the total breaking, since for any other set A which leaves some vertex still attached to the 
others, the vertex could be detached from the others with an increase of the number of components 
by 1 and a cost no more than m— ; thus the profit is nondecreasing with respect to the number of 
vertices left unattached and thus the total breaking achieves the maximum of p(A). 



-14(1-*)] 
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